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Abstract 

Proof nets are a graph theoretical representation of proofs in various fragments of 
type-logical grammar. In spite of this basis in graph theory, there has been relatively 
little attention to the use of graph theoretic algorithms for type-logical proof search. 

In this paper we will look at several ways in which standard graph theoretic 
algorithms can be used to restrict the search space. In particular, we will provide 
an O(n^) algorithm for selecting an optimal axiom link at any stage in the proof 
search as well as an 0{kn^) algorithm for selecting the k best proof candidates. 
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ical view of grammatical theory. Advantages of this theory over other for- 
malisms include a simple, transparent theory of (A term) semantics thanks to 
the Curry-Howard isomorphism and effective learning algorit hms for inducing 
grammars from linguistic data (iBuszkowski and Pennl . ll99nh . 



Proof nets, first introduced for linear logic by iGirardI (119871 ). are a way of 
presenting type-logical proofs which circumvents the redundancies present in, 
for example, the sequent calculus by performing all logical rules 'in parallel'. 
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The only non-determinism in trying to prove a theorem consists of selecting 
pairs of axiom links. Each possible selection — if correct — will result in a 
different proof. 

However, many of these possible selections can never contribute to a proof 
net, while a naive algorithm might try these selections many times. It is the 
goal of this paper to provide algorithms for filtering out these possibilities at 
an early stage and selecting the axiom link which is most restricted, thereby 
improving the performance of proof search. 



Given that the problem we are trying to solve is known to be NP complete, 
even in the non-co mmutative case, i t would be to o much to hope for a poly- 



nomial algorithm (IKanovichl . Il99ll : iPentud . |2003| ). However, we will see an 



algorithm for computing the best possible continuation of a partial proof net 
in O(n^). 

A second aim is to develop a polynomial algorithm by means of approximation. 
If we consider only the best k axiom links, then we can find these in 0{kn^). 
When best is defined as 'having axiom links with the shortest total distance ' 
this algorithm conv erges with results on proof nets and processing ( jjohnsoru . 
19981 : lMorriil . ll998h . 



2 Proof Nets and Essential Nets 



In this section we will look at two ways of presenting proof nets for multiplica- 
tive intuitionistic linear logic (MILL) together with their correctness criteria 
and some basic properties. 



Though the results will be focused on an associative, commutative system, 
it is simpl e to enforce no n-commutativity by demanding the axiom links to 
be planar (IRoordal . Il99ll ) or by label ing, either with p airs of string positions 
( Morrilll . Il995l ) or by algebraic terms ( de Grootd . Il999l ). In order to have more 
flexibility in dealing with linguistic phenon iena, other const r aints on the cor- 
rectness of proof nets have been proposed (IMoot and Puitd . |2002| ). but given 
that the associative, commutative logic is the worst case (in the sense that 
it allows the most axiom links) with respect to other fragments of categorial 
grammars there are no problems adapting the results of this paper to other 
systems. However, we leave the question of whether it is possible to perform 
better for more restricted type-logical grammars open. 



The choice of presenting the logic with two implications which differ only in 
the order of the premisses of the links is intended to make the extensions to 
the non-commutative case more clearly visible. 
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Ah A r,A,r'^c 

T,A,B,AhC r\-A A^B ro 



A hi? T,A,T'^C r,BhA 

T,A/B,A,T'^C ^ T^A/B^ '^ 

A\-B T,A,r'^C B,rhA 

T,A,B\A,T'^C ^ T^B\A^^^ 

Table 1 

The sequent calculus for L/MILL with commutativity implicit 
— - — [Ax] [Ax] 



np\- s / {np \s) s\- s 

np,{s / {np\s))\sh s 



(s / {np \ s))\s \- np \ s s \- s 

s I {np \ s), {s I {np \ s)) \ s h s 

Fig. 1. Example sequent derivation 

2.1 Sequent Calculus 



Table [Jshows t he sequent calculus for the Lambek calculus L, first proposed by 



Lambeld (1l958l ). The commutative version, the Lambek- van Benthem calculus 



LP, is also known as the multiplicative fragment of intuitionistic linear logic 
MILL. An example sequent derivation is shown in Figure [H 



2.2 Proof Nets 



Proof nets are an economic way of presenting proofs for linear logic, which is 
particularly elegant for the multiplicative fragment. When looking at sequent 
proofs, there are often many ways of deriving essentially 'the same' proof. 
Proof nets, on the other hand, are inherently redundancy-free. 

We define proof nets as a subset of proof structures. A proof structure is 
a collection of the links shown in Table [2] which satisfies the conditions of 
Definition [H A link has its conclusions drawn at the bottom and its premisses 
at the top. The axiom link, top left of the table, has no premisses and two 
conclusions which can appear in any order. The cut link, top right of the 
table, is mentioned only for completeness; we will not consider cut links in 
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Table 2 

Links for proof structures 

this paper. A cut link has two premisses, which can appear in any order, and 
no conclusions. All other links have an explicit left premiss and right premiss. 
We also distinguish between negative (antecedent) and positive (succedent) 
formulas and between tensor (solid) and par (dotted) links. 

Definition 1 A proof structure S is a collection of links such that: 

(1) every formula is the conclusion of exactly one link, 

(2) every formula is the premiss of at most one link, formulas which are not 
the premiss of a link are called the conclusions of the proof structure, 

(3) a proof structure has exactly one positive conclusion. 

Given a proof structure, we want to decide if it is a proof net, that is, if 
it corresponds to a sequent proof. A correctness criterion allows us to ac- 
cept the proof structures which are correct and reject those which are not. In 
this section, we wil l look at the acyclicity and connectedness condition from 



Danos and Regnierl (119891 ) . which is perhaps the most well-known correctness 
condition for proof nets in multiplicative linear logic. We will look at another 
condition in the next section. 

Definition 2 For a proof structure S, a switching uj for S is a choice for 
every par link of one of its premisses. 

Definition 3 From a proof structure S and a switching uj we obtain a cor- 
rection graph ujS by replacing all par links 

B C 
A 

depending on whether u) selects the left or the right premiss of the link, by one 
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np7 



Si np-j 



«i \io 



\n S5 



/l2 S3 



\l3 S6 

Fig. 2. Example proof frame 



0/ the following links. 



Si S5 Sg 



Si 
S2 
S3 



B C 



B C 



Theorem 4 (iDanos and Regnier (Il989h ;) ^ sequent C is provable in 
MILL iff all correction graphs of the corresponding proof structure are acyclic 



d connected, ie. it is a proof net. 



an 



Proof search in a proof net system is a rather direct reflection of the definitions. 
Given a sequent F h C we unfold the negative formulas in F and the positive 
formula C, giving us a proof frame. Note that given a polarized formula, 
exactly one link will apply, making this stage trivial. An example proof frame 
for the sequent 



{up \ s), (s / {up \s))\s\- s 

of Figure [1] is given in Figure [2l We have given the atomic formulas an index as 
subscript only to make it easier to refer to them; the numbers are not formally 
part of the logic. The matrix next to the proof frame in the figure represents 
the possible linkings: the rows are the negative formula occurrences, whereas 
the columns are the positive formula occurrences. White entries represent 
currently impossible connections whereas colored entries represent the current 
possibilities. 

The next stage consists of transforming the proof frame into a proof structure 
by linking atomic formulas of opposite polarity. It is this stage which will 
concern us in this paper. This is a matter of putting exactly one mark in 
every row and every column of the table. Figure [3] shows one of the 6 possible 
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Fig. 3. Example proof net 
linkings next to the matrix which corresponds to it. 

Finally, we need to check the correctne ss conditi o n. Th ough there are many 
correction graphs for a proof structure, iGuerrinil (1l999l ) shows we can verify 
the correctness of a proof structure in linear time. The proof structure in 
Figure [3] is indeed a proof net, which we can verify by testing all correction 
graphs for acyclicity and connectedness. Of the 5 alternative linkings, only 
one other produces a proof net. 



2.3 Essential Nets 



For out current pur poses, we are inte rested in an alternative correctness cri- 
terion proposed by iLamarchd (jl994j ). This criterion is based on a different 
way of decomposing a sequent, this time into a directed graph, with condi- 
tions on the paths performing the role of a correctness criterion. A net like 
this is called an essential net. The links for essential nets are shown in Ta- 



ble 3 on the next page though we follow Ide Grootd (119991 ) in reversing the 
arrows of iLamarchd (Il994j ). 



Definition 5 Given an essential net S its positive conclusion is called the out- 
put of the essential net and the negative conclusions, as well as the negatives 
premisses of any positive / or\ link, are called its inputs. 

Definition 6 An essential net is correct iff the following properties hold. 

(1) it is acyclic, 

(2) every path from the negative premiss of a positive / or \ link passes 
through the conclusion of this link, 

(3) every path from the inputs of the graph passes reaches the output of the 
graph. 
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Table 3 

Links for essential nets 
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Fig. 4. Example essential net 
Theorem 7 (Lamarche ( 19941 )) A sequent T \- C is provable in MILL iff 



its essential net is correct. 



Condition (1) reflects the acyclicity condition on proof nets, whereas condi- 
tions (2) and (3) reflect the connectedness condition. The formulation of 'every 
path' exists only to ensure correctness of the negative • link; in all other cases 
there is at most one path between two points in a correct essential net. 

Figure H] gives the essential net corresponding to the proof frame we have seen 
before, but this time with the np axiom link already performed. Remark that 
we have simply unfolded the formulas as before, just with a different set of 
links. 

It will be our goal to eliminate as many axiom links as possible for this exam- 
ple. 

Though the correctness criterion was original ly formulated for the mult iplica- 
tive intuitionistic fragment of linear logic only. iMurawski and One (l2000f ) show 



in addition to giving a linear time algorithm for testing the correctness of 
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an essential net — that we can transform a classical proof net into an essential 
net in linear time. So our results in the following sections can be applied to 
the classical case as well. 



2.4 Basic Properties 

In order to better analyze the properties of the algorithms we propose, we will 
first take a look at some basic properties of proof nets. 

2.4-1 Axiom Links 

Since we will be concerned with finding an axiom linking for a partial proof 
structure V which will turn V into a proof net, we first given some bounds 
on the number of proof structures we will have to consider. Given that the 
problem we are trying to solve is NP complete, it is not surprising these bounds 
are quite high. 

Proposition 8 Let V be a proof net and f an atomic formula, then the num- 
ber of positive occurrences of f is equal to the number of negative occurrences 
off. 

This proposition follows immediately from the fact that every atomic formula 
is the conclusion of an axiom link, where each axiom link has one positive and 
one negative occurrence of a formula / as its conclusion. 

Proposition 9 Every proof frame T has 0{a\) axiom Unkings which produce 
a proof structure, where a is the m,aximum number of positive and negative 
occurrences of an atomic formula in T . 

If we have a positive atomic formulas, we have a possibilities for the first one, 
since all negative formulas may be selected, followed by a — 1 for the second 
etc. giving us a! possibilities. 

Proposition 10 Every proof frame T has 0(4") planar axiom Unkings which 
produce a proof structure, where a is the maximum number of positive and 
negative occurrences of an atomic formula in T . 

This follows from the fact that a planar axiom linking is simply a binary brack- 
eting of the atomic formulas and the fact that there are Ca-\ such bracketings, 
where C^, the /cth Catalan number, approaches ^ j^Jiik^l'^ . 

Proposition 11 For every partial proof structure with a atomic formulas 
which are not the conclusion of any axiom link there are O(a^) possible axiom 
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links. 



Given that every positive atomic formula can be linked to every negative 
atomic formula of the same atomic type this gives us pairs. 

2.4-2 Graph Size 

Proposition 12 For every proof structure S with h negative conclusions, 1 
positive conclusion, p par links and t tensor links, the following equation holds. 

p+h=t+l=a 

Given Proposition [8|, the number of positive and negative atomic formulas is 
both a. Suppose we want to construct a proof structure S with h negative 
conclusions and 1 positive conclusion from these atomic formulas. When we 
look at the links in Table [2] we see that all par links reduce the number of 
negative conclusions by 1 and all tensor links reduce the number of positive 
conclusions by 1. 

Proposition 13 Every essential net S has v = h + l + 2{t+p) = 0{a) vertices 
and 2t + p < e < 2(t + p) + a = 0{a) edges. 

This follows immediately from inspection of the links: all conclusions of the 
essential net {h negative and 1 positive) start out as a single vertex and every 
link adds two new vertices. For the edges: the minimum number is obtained 
when we have no axiom links and all par links are positive links for \ or / 
which introduce one edge, the maximum number includes a axiom links and 
par links which are all negative links for •. 

Corollary 14 An essential net is sparse, ie. the number of edges is propor- 
tional to the number of vertices, but if we add edges for all possible axiom links 
it will be dense, ie. e is proportional to v"^. 

Immediate from Proposition [H] and Proposition [131 



3 Acyclicity 

We begin by investigating the acyclicity condition, condition (1) from Defini- 
tion El which is the easiest to verify. 

In order to select the axiomatic formula which is most constrained with respect 
to the acyclicity condition we can simply enumerate all possible axiom links 
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Fig. 5. Eliminating node c from the path from a to 6 
and reject those which produce a cycle. 

We can easily verify whether a graph contains a cycle in time proportional 
to the representation of the graph, v + e, us ing either breadth-first search 



or depth-first search (e.g. 
tively), giving us an 0(a^(v + e)) 



Gormen et al.l . Il990l . Section 23.2 and 23.3 respec- 



O(a^) algorithm for verifying all pairs. 



However, this means we will visit the vertices and edges of the graph many 
times. It is therefore a practical improvement to compute the transitive closure 
of the graph in advance, after which we can perform the acyclicity queries in 
constant time. 



In this paper we will use the Floyd- Warshall algorithm ( Gormen et al. . 19901 ) 



for computing the transitive closure, which computes the transitive closure of 
a directed graph in 0{v^) time. Though there are algorithms which perform 
asymptotically better for sparse graphs, it is hard to beat this algorithm in 
practice even for sparse graphs because of the small constants involved, while 
for dense graphs, which we will consider in the next section when we take al l 
possi ble axiom links into account, it is the algorithm of choice (jSedgewickl . 
200lh . 



The Floyd- Warshall algorithm is based on successively eliminating the inter- 
mediate vertices c from every path from a to b. Given a vertex c and the paths 
a — > c — > 6 for all a and b we create a direct path a —> 6 if it didn't exist 
before. That is to say, there is a path from a to 6 if either there is a path from 
a to c and from c to 6 or if there is a path from a to 6 which we already knew 
about (Figure [5]). 



path{a, b) := path{a, b) V {path{a, c) A path{c, b)) (1) 

After eliminating c, for every path in the original graph which passed through c 
there is now a shortcut which bypasses c. After we have created such shortcuts 
for all vertices in the graph it is clear that the resulting graph has an edge 
a — >■ 6 iff there is a path from a to 6 in the original graph. 



Figure 6 on the next page shows the essential net of Figure |4 on page 7 



m 



adjacency matrix representation (left of the figure) and its transitive closure 
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Fig. 6. Initial graph (left) and its transitive closure (right) 

(right of the figure). A square in the matrix is colored in ifi^ there is a link from 
the row to the column in the graph. 

The relevant part of the graph for the acyclicity test is marked by a square 
around columns 1 — 3 of row 4 — 6. We see here, for example, that given the 
existence of a path 4 ^ 1 an axiom link between si and would produce a 
cycle (via node 10 in the original graph) and is therefore to be excluded. A 
similar observation can be made for and S3. 



4 Connectedness 



Verifying conditions (2) and (3) from Definition [6] is a bit harder. The question 
we want to ask about each link is: does this link contribute to a connected 
proof structure? Or, inversely, does excluding the other possibilities for the two 
atomic formulas we connect mean a connected proof structure is still possible. 

To check the conditions we need to verify the following: 

(1) for every negative input of the net we verify there exists a path to the 
positive conclusion, 

(2) for every negative • link we verify that both paths leaving from it reach 
their destination, 

(3) for every positive / or \ link we check the existence of a path from its 
negative premiss to its positive conclusion continuing to the positive con- 
clusion of the essential net. 

Given that we are already computing the transitive closure of the graph for 
verifying acyclicity, we can exploit this by adding additional information to the 
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Fig. 7. Initial graph (left) and its transitive closure (right) 

matrix we use for the transitive closure. There are many ways of storing this 
extra information, the simplest being in the form of an ordered list of pairs. 
Given that, for a atomic formulas, each possible connection allows (a — 1)^ 
other connections (ie. it is agnostic about all possibilities not contradicting 
this one) but excludes 2(a — 1) possibilities, it is more economic to store the 
connections which are excluded. For example, the ordered set associated to 
the edge from 1 to 4 will be {1 — 5, 1 — 6, 2 — 4, 3 — 4}, meaning "there is an 
edge from 1 to 4 but not to anywhere else and the only edge arriving at 4 
comes from 1" . 



Note that in the description of the Floyd- Warshall algorithm, we made use 
only of the logical 'and' and 'or' operators. For ordered sets, the corresponding 
operations are set union and set intersection. For eliminating vertex c from a 
path from a to 6, we first take the union of the ordered set representing the 
links which are not in a path from a to c with that representing the links not 
in a path from c to 6 (any vertex in either set couldn't be in a path from a via 
c to h). Then, we take the intersection of this set with the old set associated 
to the path from a to h. 



path{a, b) := path{a, b) fl {path{a, c) U path{c, b)) (2) 

Note that Equation [2] is simply Equation [1] with both sides negated, the nega- 
tions moved inward and set union and intersection in the place of the logical 
'or' and 'and' operators. 

Given that we can implement the union and intersection operations in linear 
time with respect to the size of the input sets, the total complexity of our 
algorithm becomes 0{v^2{a — 1)) = O(a^). 
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Fig. 8. Essential net with acyclicity and connectedness taken into account 



Figure |7 on the previous page] shows the initial graph and its transitive clo- 
sure. Every entry from the previous graph is now divided into 9 subentries 
— one for each possible axiom link. We read the entry for 1 — 4 as follows: 
the first row indicate the link between 1 and 4, and thereby the absence of a 
link 1 — 5 and 1 — 6, the second row indicates the possibilities for linking 2, 
which just excludes 2—4 and the third row indicates that for 3 just the 3 — 4 
connection is impossible. Again, we have marked the table entries relevant for 
our correctness condition by drawing a black border around them. 

When we look at the transitive closure, we see that, should we choose to link 
S2 to Sg, this would make it impossible to reach vertex 6 from vertices 4, 5, 9 
or 13. Remark also that cycles need to be excluded separately. For example, 
the path from 5 to 1 in Figure [7] does not mean we need to exclude the si — S5 
axiom link. 

Figure [8] shows the proof frame of Figure H] with all constraints taken into 
account. We see that, whatever choice we make for the first axiom link, all 
other axiom links will be fixed immediately, giving us the two proofs si — S5, 
S2 — S4, S3 — Sg (ie. the proof shown in Figure [3]) and Si — Sq, S2 — s^, S3 — S4. 



5 Extensions and Improvements 



An interesting continuation of the themes explored in this paper would be to 
look at dynamic graph algorithms, where we maintain the transitive closure 
under additions and deletions of edges. This would avoid recomputing the tran- 
sitive closure from scratch after every axiom link and woul d allow us to take 



advantage of the information we have already computed. iKing and Saggert 
(119991 ) propose an algorithm with 0{n^) update time based on keeping track 
of the number of paths between two vertices, which is easy enough to adapt to 
our current scenario in the case of acyclicity tests, though it remains unclear 
if it can be adopted to check for connectedness. 
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Another improvement would be to represent the ordered sets differently. Given 
that their structure is quite regular, it may be possible to improve upon linear 
time union and intersection. However, given that for each iteration the size 
of the sets either remains the same or decreases (the principal operation in 
Equation [2] being intersection), it remains to be seen if this will result in a 
practical improvement. 

Finally, we can consider the work in the two previous sections as using a sort 
of 'lookahead' of one axiom link, which is to say we exclude all axioms links 
which, by themselves, would produce a cyclic or disconnected proof structure. 
This can be extended quite naturally to doing k axiom links at the same time, 
though each extra axiom link will multiply the required space by 0(n2), the 
required time for acyclicity by 0{n^) and the required time for connectedness 
by 0{n^) (this is relatively easy to see because we are in effect substituting 
n{n — 1) for the old value of n). 



6 Polynomial Time 



If we add weights to the different connections, the situation changes. The sim- 
plest way to add weights to our graph is to use the distance between two atomic 
formulas as their weight and prefer the total axiom linking with the least total 
weight. This choice of assigning weights is closely relate d to work on le ft-to- 
right processin g of sentences, proposed independently by I JohnsonI (119981 ) and 
Morrilll (119981 ). The claim they make is that the complexity of a phrase de- 
pends on the number of 'open' or unlinked axiom formulas a reader /listener 
will have to maintain in memory to produce a parse for this sentence. 

Finding a min imum- weight solution to this problem is known as the assign- 
ment problem. iMurtyl (119681 ) was the first to give an algorithm for generating 
the assignments in order of increasing cost. His 0{kn'^) algorithm for finding 
the k best assignments can be improved to 0{kn^), even though tests on ran- 
domly generated g raphs have shown the observed complexity to be 0{kn'^) 



(iMiller et all . Il997f ) 



Because using the distance as weight tends to favor cyclic connections, it is 
preferable to make one pass of the algorithm described in the previous section 
and assign a weight of infinity to all edges which are either cyclic or discon- 
nected. Figur e [H] shows a weighted graph corresponding to an example from 
Morrilll (jl998l ). the sentence 'someone loves everyone', which has a preferred 
reading where the subject has wide scope. Of the four readings of this sen- 
tence (two if we enforce planar ity) there is a preference for connecting si — S2, 
S3 — se, S5 — S4, with a total weight of 11, as compared to connecting si — se, 
-Ss — S4, S5 — S2, with a total weight of 19. 
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Fig. 9. Minimum Weight Linking for 'someone loves everyone' 

There is an important difference between using distance weights hke we do here 
and keeping track of the open axiom hnks hke Johnson and Morrill, which is 
that we select a linking which is best globally. It is therefore to be expected 
that we will make different predictions for some 'garden path' sentences (ie. 
sentences where a suboptimal local choice for the axiom links will be made). 

Finding an appropriate value of k and fragments of type-logical grammar for 
which this k is guaranteed to find all readings remains an interesting open 
question. Cautious people might select k = n\, given that in any type- logical 
grammar there are at most n\ links possible, to generate all readings in in- 
creasing order of complexity. It seems tempting to set k t o n^, becaus e maii y 
interesting grammar formalisms have 0{n^) complexity (jJoshi et al.l . Il99ll ). 
and find type-logical fragments for which we can show proof search using this 
strategy is complete. 



7 Conclusion 



We have seen how standard graph algorithms can be modified to aid in proof 
search for type-logical grammars by rejecting connections which can never 
contribute to a successful proof. 

We have also seen how weighing the connections allows us to enumerate the 
links in increasing order and linked this with processing claims. 
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